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HYDRAS: DIRECTED HYPERGRAPHS AND HORN FORMULAS 
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Abstract. We introduce a new graph parameter, the hydra number , arising 
from the minimization problem for Horn formulas in propositional logic. The 
hydra number of a graph G = ( V , E) is the minimal number of hyperarcs of 
the form u,v —>• w required in a directed hypergraph H = (V,F), such that 
for every pair (u,v), the set of vertices reachable in H from {u,u} is the entire 
vertex set V if ( u , v) E E, and it is { u , u} otherwise. Here reachability is 
defined by forward chaining, a standard marking algorithm. 

Various bounds are given for the hydra number. We show that the hydra 
number of a graph can be upper bounded by the number of edges plus the 
path cover number of the line graph of a spanning subgraph, which is a sharp 
bound in several cases. On the other hand, we construct single-headed graphs 
for which that bound is off by a constant factor. Furthermore, we characterize 
trees with low hydra number, and give a lower bound for the hydra number of 
trees based on the number of vertices that are leaves in the tree obtained from 
T by deleting its leaves. This bound is sharp for some families of trees. We 
give bounds for the hydra number of complete binary trees and also discuss a 
related minimization problem. 


1. Introduction 

We consider a problem concerning the minimal number of hyperarcs in directed 
hypergraphs with prescribed reachability properties. In this paper a directed hy¬ 
pergraph H = (V, F ) has size-3 hyperarcs of the form u,v —>• w where u, v is called 
the body (or tail) and w is called the head of the liyperarc. Reachability is defined 
by a marking procedure known as forward chaining. A vertex w £ V is reachable 
from a set S C V if the following process marks w: start by marking vertices in S, 
and as long as there is a hyperarc a,b c such that a and b are both marked and 
c is unmarked, mark c as well. 

Given an undirected graph G = (V. E), we would like to find the minimal number 
of hyperarcs in a directed hypergraph H = (V,F), such that for every pair (u,v), 
the set of vertices reachable from {u, i>} in H is the whole vertex set V if (u, v) € E, 
and is {it, v} otherwise. In other words, given a set of hypergraph bodies (specified 
by a graph), we look for the minimal total number of heads assigned to these 
bodies such that every body can reach every vertex. We call the minimum the 
hydra nitra&erQH of G, denoted by h(G). 
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1.1. Motivation from propositional logic. The problem is a combinatorial re¬ 
formulation of a special case of the minimization problem for propositional Horn 
formulas. Horn formulas are a basic knowledge representation formalism. Horn 
minimization is the problem of finding a shortest possible Horn formula equiva¬ 
lent to a given formula. There are approximation algorithms and computational 
hardness and inapproximability results for this problem 0 [7l 0 [12]. Special cases 
correspond to the well studied transitive reduction and minimum equivalent di¬ 
graph problems for directed graphs. Estimating the size of a minimal formula is 
not well understood even in rather simple cases. A hydra formula ip is a definite 
Horn formula with clauses of size 3 such that every body occurring in the formula 
occurs with all possible heads. The minimal number of clauses needed to represent 
(p equals the hydra number of the undirected graph G corresponding to the bodies 
in ip. 

Besides being a natural subproblem of Horn minimization, the hydra minimiza¬ 
tion problem is also of interest for the following reason. The Horn body minimization 
problem is the problem of finding, given a definite Horn formula, an equivalent Horn 
formula with the minimal number of distinct bodies. There are efficient algorithms 
for this problem mm hque]. Thus one possible approach to Horn minimization is 
to find an equivalent formula with the minimal number of bodies and then to select 
as few heads as possible from the set of heads assigned to the bodies. This approach 
is indeed used in an approximate Horn minimization algorithm [7]. Hydras are a 
natural test case for this approach. 

The hydra property in Horn formulas is also of interest to relational database 
theory. It corresponds to the Boyce-Codd normal form, where every determinant 
set of attributes is a candidate key, that is, it either determines all attributes, or 
only trivial ones, i.e., attributes that are subsets of the determinant. 

1.2. Summary of results. The hydra number of a graph is defined in Section [2] 
where we also explain the Horn formula minimization problem and show, in Propo¬ 
sition [2T7J that the two minimization problems are indeed equivalent. For the rest of 
the paper we use the hypergraph terminology. In Section [31 we begin by observing 
that the hydra number of a graph is at least the number of edges and at most twice 
the number of edges, for every graph on at least three vertices. This determines the 
“playing field” for the hydra number. It is useful to consider the excess number of a 
graph, which is the difference of its hydra number and its number of edges. Graphs 
satisfying the lower bound, i.e., graphs with excess 0, are called single-headed , and 
deserve further study as the best behaving graphs for the hydra number. It turns 
out that single-headedness is related to, but is more general than Hamiltonicity. 
We also note that adding edges to a graph can only mildly increase its hydra num¬ 
ber (Proposition [33]) and we give some simple sufficient and some simple necessary 
conditions for single-headedness that we will apply later in the paper. 

Theorem 14.21 in Section [4] gives a general upper bound: it shows that the hydra 
number is at most the number of edges plus its generalized line graph path cover 
number p{G ), the path cover number of the line graph of any spanning subgraph 
with no isolated vertices. We give some bounds on the hydra number of discon¬ 
nected graphs, and discuss the connection to the total interval number of graphs. 
This connection is used to determine the maximal hydra number of a tree and the 
maximum excess number of a connected graph. Section [5] shows that the bound 
of Theorem 14.21 can be far from optimal. In Theorem 15.11 we construct a family 
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of single-headed graphs with a linear number of edges and show that none of its 
subgraphs has a Hamiltonian line-graph and that, in fact, the quantity p{G) is also 
linear. 

We then focus our attention on the hydra number of trees in Sections [6] and 
B We show that single-headed trees are precisely the stars and that trees with 
excess one are precisely the non-star caterpillars (Theorem 16.111 . We also prove a 
general lower bound for trees ^Theorem 16.211 by showing that the hydra number of 
a tree T is at least its number of edges plus \£{T)/ 2], where £(T) is the number 
of leaf vertices in the tree obtained from T by deleting its leaves. The statement 
generalizes to graphs with certain types of pendant trees. This lower bound is 
sharp and is attained by spider trees fCorollarv l7.Hl . In fact, spiders with an odd 
number of legs of length exactly two, also attain the upper bound of Theorem 021 
We conclude the discussion of spiders by considering the in-degree of vertices in 
optimal hydras representing a spider family fTheorem 17.41) . We end the section 
by considering binary trees: we show that the hydra number of a complete binary 
tree is between 11 E (T)| and E (T) |+1 fTheorem 17.611 . and demonstrate that a 
connectivity restriction on the spanning subgraph in Theorem 14. 21 can significantly 
worsen the upper bound on the hydra number even for trees (Theorem 17.71) . 

Finally, in Section 0] we consider the related problem of finding the minimal 
number of hyperarcs for which every fc-tuple of vertices reaches every vertex, and 
we give almost matching lower and upper bounds. 

Kucera in recent work m studied the complexity of deciding single-headedness, 
and proved that this problem, and by consequence the problem of calculating the 
hydra number of a graph, is NP-complete. He also introduced a subclass of trees 
for which the hydra number can be computed in polynomial time. 

2. The hydra number 

In this section we discuss the two approaches for arriving at the notion of the 
hydra number and their relationship. 

2.1. The directed hypergraph viewpoint. The closure cIh{S) of a set of ver¬ 
tices S with respect to directed hypergraph H is the set of vertices marked by the 
forward chaining procedure (described in the first paragraph of the paper) started 
from S. 

Definition 2 . 1 . A directed 3-hypergraph H = (V, F) represents an undirected graph 
G=(V,E)if 

i. (u,v) £ E implies cIh(u,v) = V, 

ii. (u,v) 0 E implies cIh{u,v) = {u,v}. 

Definition 2 . 2 . The hydra number h(G) of an undirected graph G = (V,E) is 
min{|F'|: H = (V,F) represents G}. 

These notions are illustrated by the following example. General bounds for the 
hydra number of complete binary trees are given in Section B 

Example 2 . 3 . Consider B 2 , the complete binary tree of depth 2. In Figure POI 
we see an example of a directed 3-hypergraph representing B 2 . The bodies of the 
hyperarcs are the edges of B 2 and the heads of the hyperarcs of H are pointed to by 
the arrows. Proposition^. 7| shows that this is optimal, and thus h(B 2 ) = 7. 
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Figure 1. H: a directed hypergraph representing B 2 . 

Remark 2.4. The removal of an isolated vertex decreases its hydra number by 
one.For the rest of the paper we assume that graphs contain no isolated vertices. 

2.2. The Horn formula viewpoint. A definite Horn clause in propositional logic 
is a disjunction of literals where exactly one literal is unnegated. Such a disjunction 
can also be viewed as an implication; for example the clause x V y V z is equivalent 
to the implication x,y —»• z. The tuple x, y is the body and the variable z is the 
head of the clause. The size of a clause is the number of its literals. A definite 
d-Horn formula is a conjunction of definite Horn clauses of size d. A clause C is 
an implicate of a formula p if every truth assignment satisfying p satisfies C as 
well. The implicate C is a prime implicate if none of its proper subclauses is an 
implicate. 

Implication between a definite Horn formula p and a definite Horn clause C 
can be decided by the forward chaining procedure. Reformulated for propositional 
logic, it marks every variable in the body of C, and while there is a clause in p 
with all its body variables marked and its head unmarked, it marks the head of 
that clause as well. Then p implies C iff the head of C gets marked. 

Definition 2.5. A definite 3-Horn formula p is a hydra formula, or a hydra, if for 
every clause x,y —» z in p and every variable u, the clause x,y —>• u also belongs to 

p. 

For example, the formula 

(x, y —» z) A (x, y —> u) A (x, z —> y) A ( x , z —>• u) 

is a hydra formula. Redundant clauses like x, y —> x are omitted for simplicity. 

Definition 2.6. The Horn formula minimization problem for hydras or hydra 
minimization problem is the following: given a hydra formula p, find an equivalent 
Horn formula with a minimal number of clauses. 

The following proposition is central to this study as it establishes that every 
prime implicate of a hydra is a clause occurring in the hydra itself, a fact that does 
not hold for definite 3-Horn formulas in general. Thus minimization for hydras 
amounts to selecting a minimal number of clauses from the hydra that are equivalent 
to the original formula. 

Proposition 2.7. Every prime implicate of a hydra formula p is a clause of p. 

Proof. First note that all prime implicates of a definite Horn formula are definite 
Horn clauses m ■ Consider a hydra p and a definite Horn clause C. If the body 
of C is of size 1, or it is of size 2 but it does not occur as a body in p then forward 
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chaining cannot mark any further variables, thus C cannot be an implicate. If 
the body of C has size at least 3 then it must contain a body x , y occurring in ip, 
otherwise, again, forward chaining cannot mark any further variables. But then 
the clause x, y —> head(C) occurs in ip and so C is not prime. □ 

Proposition ^.7l implies that the minimal formula size of a hydra ip and the hydra 
number of the undirected graph G formed by the bodies in ip are the same. 

Remark 2.8. For the remainder of the paper we use hypergraph terminology. 

3. First bounds on the hydra number and single-headedness 

In this section, starting with an easy bound for the hydra number of any graph 
with no isolated vertices, we define a class of graphs that provide the best behaving 
hydras, and give some preliminary results about their structure. 

Proposition 3.1. For every graph G = (V,E) with at least three vertices 

\E(G)\<h(G)<2\E(G)\. 

Proof. For the upper bound construct a hypergraph of size 2| E{G) | by first ordering 
the edges of G, and then using each edge as the body of two hyperarcs whose heads 
are the two endpoints of the next edge in G. For the lower bound, note that each 
edge of G must be a body of at least one hyperarc. □ 

Equality holds in the upper bound if and only if G is a matching. Indeed, if G 
is a matching then every edge must occur as the body of at least two hyperarcs as 
otherwise forward chaining cannot mark any further vertices. If G not a matching, 
it must contain a path of length two. Let (u, v ) and ( v , w) be the two edges of the 
path. Order the edges of G so that (u, v ) and (v, w) appear consecutively. We can 
modify the proof above noting that (it, v) only needs the single head w. This gives 
a hypergraph representing G with 2\E(G)\ — 1 hyperarcs. 

In view of the lower bound one can consider the excess of a graph to be h(G ) — 
|E(G)|. Graphs of excess 0 are of particular interest as they represent “most com¬ 
pressible” hydras. 

Definition 3.2. A graph G = (V,E) is single-headed if h(G) = \E(G)\. 

In other words, a graph is single-headed iff there is a hypergraph H = ( V , F) 
such that every edge of G has exactly one head assigned to it, every hyperarc body 
in H is an edge of G and every pair forming an edge of G has the entire vertex 
set as its closure. Cycles, for example, are single-headed, as shown by the directed 
liypergraph 

( 1 ) (vi,V 2 -> V3), (V2,V3 -A- Vi), ■■■, (Vk-l,Vk Vi). 

Adding edges to a cycle preserves single-headedness. For example, the graph 
obtained by adding edge (vi, vj) is represented by the directed hypergraph obtained 
from ([I]) by adding the hyperarc Vi,Vj —> , where i + 1 is meant modulo to. 

Thus we obtain the following proposition. We will discuss stronger forms of this 
statement in the next section. 

Proposition 3.3. Hamiltonian graphs are single-headed. 
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We call a body (u,v) single-headed (resp., multi-headed ) with respect to a di¬ 
rected hypergraph H representing a graph G if it is the body of exactly one (resp., 
more than one) hyperarc of H. 

Remark 3.4. Assume that the directed hypergraph H = (V. F) represents the graph 
G = (F, E ) and |F|> 4. If u , v —► w £ F and u, v is single-headed in H then w must 
be a neighbor of u orv. Indeed, otherwise cIh{u,v ) = {u,v,w} C V. This is a fact 
which we use numerous times in the proofs without referring to it explicitly. 

The following proposition generalizes the argument proving Proposition 13.31 

Proposition 3.5. Let G be a graph with no isolated vertices and let G' be a span¬ 
ning subgraph of G with no isolated vertices. Then 

h(G)<h(G') + \E(G)HE(G')\. 

If G’ is single-headed then G is also single-headed. 

Proof. Let H' be a directed hypergraph of size h(G') representing G'. Since G' 
is a spanning subgraph of G and contains no isolated vertices, for every edge 
(u,v) £ E(G) \ E(G r ) there is an edge (v,w) £ E{G'). The directed hypergraph H 
representing G obtained from H' by adding the hyperarc u,v —> w to H' for each 
edge (u,v) £ E(G)\E(G') satisfies the requirements. The second statement follows 
trivially. □ 

It may be interesting to note that if we start with a matching on n vertices and 
keep on adding edges until the graph becomes a Hamilton cycle, the hydra number 
does not increase, each graph has hydra number n. We do not know if the hydra 
number can go down after adding an edge. 

A second proposition gives a sufficient condition for single-lieadedness based on 
single-headedness of a non-spanning subgraph. This will be a crucial construction in 
showing that single-headed graphs can have quite complex structure in Theorem l5.ll 

Proposition 3.6. Let G be a connected graph and (u,v) qL E{G). Construct 
the graph G with vertex set V(G) = F(G) U {«;} and edge set E(G ) = E(G) U 
{(u,v), for some w qL F(G). If G is single-headed then G is single-headed. 

Proof. Let H be a directed hypergraph representing G and containing exactly 
\E(G) \ hyperarcs. Construct H from H by adding hyperarcs u,v —>• w and v,w —»• z, 
where z is a neighbor of v in G guaranteed to exist by the connectivity of G. Since 
all pairs in E(G) reach both u and v in H (and in H), hyperarc u,v —>• w ensures 
all pairs in E(G) can reach in H the new variable w as well. On the other hand, 
hyperarc v,w —> z ensures that the new pairs (it, v) and (v, w) can reach all other 
variables. Finally, there are \E(G)\ hyperarcs in H. □ 

Next we see a simple sufficient condition for a graph not to be single-headed. 

Proposition 3.7. Let G be the union of two disjoint subgraphs G\ = (Fi, E \) and 
G 2 = (V 2 ,E 2 ), connected by a cut-edge. If both G 1 and G 2 contain at least two 
vertices then G is not single-headed. 

Proof. Assume that G is single-headed and let H be a directed hypergraph demon¬ 
strating this. Let u £ G\, v £ G 2 , and (u,v) be the cut-edge. There is exactly 
one hyperarc of the form u,v —>• z in H. If z is in G\ (resp., in G 2 ) then forward 
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chaining started from z,u (resp., z,v) cannot mark any vertices in G 2 (resp., G 1 ) 
other than v (resp., u ). □ 

4. Line graphs and a hydra number upper bound 
This section contains an upper bound to the hydra number of a general graph. 

4.1. Hydra number upper bound. The line graph L{G) of G has vertex set 
V(L(G)) = E(G) and edge set E(L(G)) = {(e,/)|e ^ f e E(G) and e n / ± 0}. 

A (vertex-disjoint) path cover of G is a set of vertex-disjoint paths such that every 
vertex v £ V is in exactly one path. The path cover number of G , pcn{G), is the 
smallest integer k such that G has a path cover containing k paths. Also recall that 
a spanning subgraph G' of G is a subgraph with vertex set V(G') = V(G) and edge 
set E{G') C E(G). We note that a spanning subgraph need not be connected. 

In Proposition 13.31 we noted that Hamiltonian graphs are single-headed. The 
second part of Theorem 14.21 extends this to show that Hamiltonicity of the line 
graph of G, and furthermore, Hamiltonicity of the line graph of a subgraph of G 
with no isolated vertices, is also sufficient for single-headedness. 

Remark 4 . 1 . Hamiltonicity of L(G) is a strictly weaker condition than Hamiltonic¬ 
ity of G. In, particular Hamiltonicity of G is easily seen to imply Hamiltonicity of 
the line graph, and a triangle with a pendant edge shows that the converse fails. 

Furthermore, the number of edges plus the path cover number of the line graph 
of any spanning subgraph with no isolated vertices gives a general upper bound 
for the hydra number. Thus, for a graph G we define p(G) to be the following 
parameter of the graph: 

p{G) = min {pcn{L(G')) : G' is a spanning subgraph of G with no isolated vertices} 
Theorem 4 . 2 . Let G be a graph with no isolated vertices. Then 

h(G) < \E(G)\+p(G). 

In addition, if there exists a spanning subgraph G' with no isolated vertices such 
that L{G') is Hamiltonian then G is single-headed. 

Proof. By Proposition 13.51 it is sufficient to prove the bounds for an appropriate 
subgraph G'. 

Let k = p(G), {Pi]\ be the minimum path cover of L{G') for some spanning 
subgraph G' with no isolated vertices and h be the number of vertices of the path 
Pi. Direct the edges of each path Pi so that Pi is a directed path. Let e* = (a2/») 
and fi = (Ui,Vi ) be the first and last edges in Pi, respectively (if Pi is a single 
vertex then d = fi). 

We construct a directed hypergraph H representing G' and satisfying the re¬ 
quirements as follows. First, for each path Pi of at least 2 vertices we add k — 1 
hyperarcs: for each directed edge (e, /) £ Pi, where e = (u,v) and / = (v,w), add 
a hyperarc u,v —A w to H. 

If A = 1 then we complete the construction of H by adding two hyperarcs, 
u\,Vi -A X\ and U \, iq —> y\. If k > 1 then we complete the construction by adding 
the 2k hyperarcs 

(' Uk,Vk x\),(uk,Vk ->■ y\) and (ui,Vi ),(ui,Vi ->• yi+i), 
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for 1 < i < k — 1. It follows directly from the construction that the hypergraph H 
represents G'. 

For the second part of the theorem, suppose G' is a spanning subgraph with no 
isolated vertices such that L(G') is Hamiltonian. Let C be a Hamiltonian cycle in 
L(G'). Direct the edges of C so that C is a directed Hamiltonian cycle. The directed 
hypergraph H satisfying the requirements is constructed by adding a hyperarc 
u,v —» w for each directed edge (e, /) £ C, where e = (u, v) and / = (v, w). □ 

4.2. Disconnected graphs. In this subsection we add a few remarks on the hy¬ 
dra numbers of disconnected graphs. First note that the general upper bound of 
2|.E(G)|, which applies to all graphs, can be improved for connected graphs as 
follows: build a hydra for a spanning tree and then add additional single-headed 
edges using Proposition 13.51 Theorem 17.21 gives an essentially 5/4|H(G)| for the 
hydra number of trees, so this way one gets an upper bound of \E{G)\+\V (G)\/ & 
for connected graphs. This can be extended to disconnected graphs by adding two 
hyperarcs per connected component, pointing to the next connected component in 
a circular order. 

On the other hand, the following lower bound is easy to see for any graph G 
containing at least two connected components. 

Proposition 4.3. Let G consist of k > 2 connected components Gi, G 2 ,..., Gk for 
k >2, and let each Gi contain at least two vertices. Then h(G ) > \E(G)\+k. 

Proof. Consider a directed hypergraph H representing G. For each component Gi 
there must be an edge (uj, vf) £ G,; such that Ui, W; —>• w £ H for some w £ Gi, and, 
since w is not adjacent to Ui or v t in G, (ttj,Uj) cannot be single-headed in H. □ 

The relationship between the hydra number of a disconnected graph and the sum 
of the hydra numbers of its components is open. For single-headed components 
at least one multi-headed edge is needed, otherwise other components cannot be 
reached. Thus one can ask the following. 

Problem 4.4. Let G consist of k connected components Gi, G 2 ,..., Gk for k > 2, 
such that each Gi contain at least two vertices and s of the components are single¬ 
headed. Then does the following hold: 

k 

KG) = J2 h ( G *) + s - 

i=l 

It can be shown that the answer is positive when each Gi is single-headed or 
contains a spanning caterpillar tree. This is because Yhi =1 K@i) + s = \E(G)\+k 
by Proposition 13.51 and Theorem 16.11 

4.3. Hydras and the total interval number of a graph. Theorem 14.21 in 
relating the hydra number of a graph G with the path cover number of the line 
graph of its subgraphs, indicates a connection with another graph parameter, the 
total interval number of a graph, introduced independently by {§] and |2'D| . The 
total interval number of a graph G = (V, E ) is the minimum number of disjoint real 
intervals necessary in a multiple-interval representation of a graph; the latter is an 
assignment multiple disjoint real intervals to each vertex of G with the property 
that two vertices are adjacent in G if and only if their assigned sets intersect. 
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In [14], it was shown that for a triangle-free graph G, the total interval number 
of G, t(G) = \E(G)\+pcn(L(G)). The same result follows from p~7] . Since clearly 
p(G) < pcn(L(G)) and by Theorem 14.21 the hydra number of G is at most the total 
interval number of the graph establishing the following corollary. 

Corollary 4.5. For a triangle-free graph G with no isolated vertices h(G) < t(G). 
In addition, for a graph G containing a triangle-free subgraph G' with no isolated 
vertices, h(G ) < r(G'). 

In Section rm we use this corollary to bound the maximum hydra number of a 
graph. 


5. Single-headed graphs with large p(G) 

In this section we construct a family of sparse single-headed graphs with large 
generalized line graph path cover number. This shows that the upper bound of 
Theorem 14.21 can be off by a constant factor, and that the condition of the second 
half of the theorem is not necessary for single-headedness. 

Theorem 5.1. There is a family of single-headed graphs {Gk,k > 2}, where Gk 
has @(fc) edges and p{Gk) = 0 ( k ). 

Proof. Consider the sequence of graphs Gk,k> 2 constructed as follows. Starting 
from an 8fc-cycle, with vertices vq, . .., Vsk-i, and pendant edges XiVu and yiV^k+u 
for 0 < i < k — 1, add a vertex z t and the edges (xi,yf), (y^, zf), for each i such that 
0 < i < k — 1. Recall the construction in Proposition 13.61 



Figure 2. Single-headed graph Gk for k = 4 from Theorem 15.11 
(left) and its line graph (right). The arcs on the line graph indicate 
of a smallest-possible directed hypergraph realizing the hydra. 

By Proposition 13.61 Gk is single-headed, since a cycle with attached pendant 
edges has a Hamiltonian line graph. We will show that for an arbitrary, not nec¬ 
essarily connected, spanning subgraph G' C Gk the path cover number of L(G') is 
at least fc/4. 
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Define Di to be the set of vertices in the ?'th diagonal of L(G), namely XiVa, 
Xiyi, yiZi, and yiV 4 k+ 4 i- Consider an arbitrary path cover S = {Pj : 1 < j < s} of 
the vertices of L(G"). 

Lemma 5.2. Let Di = Di n V(L(G')), and let G[Df] be the subgraph of L(G') 
induced by Df. If G[Df] does not contain an endpoint of a path in S, then Df = Di 
and one path in S covers all vertices in Di. 

Proof. Suppose that G[Df} does not contain an endpoint of a path in S, and assume 
for contradiction that e £ Di\ Df. Since G' is spanning and contains no isolated 
vertices, it must contain the edge (jji, Zi ), and soe / yiZi. Also e 0 {xiyi, yii’ 4 k+ 4 i}, 
else yiZi would be a degree-1 vertex in L(G'), and thus it would be an endpoint of 
a path in S. Furthermore e ^ XiV 4 i, otherwise S must have a path endpoint in the 
triangle {xiyi, yiZi, yiV 4 k+ 4 i}. Thus Di is contained in the vertex set of L(G'), and 
due to the structure of the diagonal and the assumption that no path endpoints of 
S fall in Di , all vertices in the diagonal are covered by exactly one path P of S. □ 

Define A f to include all vertices in Di along with the cycle vertices V 4 i- 3 V 4 i- 2 , 
V4i-2V4i-i, V4i-iV4i, V4iV4i+i, U4i+iU4i+2, V4i+2V4i+3, and their antipodes on 
the circle V4k + 4i-3 V4k+4i-2, V4k+4i-2V4k+4i-l, V4k+ii-l v ik+ii-, V4k+4il’4k+4i+l, 
T4fc+4i+1^4fc+4i+21 V4k+4,i+2l’4k+4i+3 ■ 

Let Xf = Xi fl V(L(G')). We claim that the subgraph G[A'/] induced by the 
vertex set X / contains at least one endpoint of a path in S. Suppose not. By 
Lemma [5.21 all vertices in Di are in L{G’). A case analysis shows that all other 
vertices in Xi must be present, otherwise a degree-1, or degree-0 vertex is introduced 
in G[Xi] or G' is not both spanning and without isolated vertices. Indeed, deleting 
two consecutive vertices along the cycle Vi-ii’i and ViVi+i would isolate vertex Vi in 
G', and deleting any one vertex VjUj+i, or any two non-consecutive vertices, along 
the cycle would make at least one of Vi-\Vi and 'ty + i'ty + 2 a degree-0 or degree-1 
vertex in G[A/]. Thus there must be a path P in S going through all the vertices 
of Xi, and by hypothesis it has no endpoints in X t . But this is not possible as no 
such path can include all three of the vertices V 4 i-\V 4 i, xpyi, and U 4 i+ 2 U 4 i+ 3 . 

There are k/2 disjoint sets Xf and so there are at least k/ 4 paths in S. □ 

6. The hydra number of trees 

In this section we obtain bounds for the hydra number in the special case of 
trees. This is a natural subclass to consider. Furthermore, Proposition 13.51 shows 
that the excess of a graph does not increase by adding edges and thus trees actually 
give examples of graphs with maximal excess. 

6.1. Trees with Low Hydra Number. We begin the discussion of the hydra 
number of trees, with trees having excess 0 or 1. 

A star is a tree that contains no length-3 path. A caterpillar is a tree for which 
deleting all vertices of degree one and their incident edges from the tree gives a 
path. We call this path the spine of T, and note that it is unique. A useful 
characterization of caterpillars is that they do not contain the subgraph in Figure [3] 
m (see also [21] p.88]). 

Caterpillars have been instrumental in Em where finding maximal caterpillars 
starting from the leaves of the tree was the basis for a polynomial algorithm used 
to find a minimum Hamiltonian completion of the line graph of a tree (which is the 


HYDRAS: DIRECTED HYPERGRAPHS AND HORN FORMULAS 


11 


A 

Figure 3. The forbidden subgraph for caterpillars. 


same as finding a minimum path cover). A linear algorithm was later put forth by 
[II for the same problem. For general graphs the problem is NP-hard. Furthermore, 
[B] proves that finding a Hamiltonian path is NP-complete even for line graphs. 

Stars are the only trees that are single-headed, and caterpillars are the only 
non-star trees that can attain h{T) = |i£(T , )|-|-l. 

Theorem 6.1. Let T be a tree. Then 

i. h(T ) = |i?(T)| if and only ifT is a star, 
ii. h(T) = |1?(T)|-|-1 if and only ifT is a non-star caterpillar. 

Proof. First we prove the upper bounds. The single-headedness of stars is easily 
seen directly, or follows from Theorem l4.2l For caterpillars, the upper bound follows 
from Theorem 14.21 as the line graph of a caterpillar contains a Hamiltonian path. 

For the lower bounds, note that if a tree is not a star then it contains a path of 
length three. The middle edge of the path is a cut-edge between two components 
of at least two vertices, hence Proposition 13.71 implies that the tree is not single¬ 
headed. The lower bound for non-star caterpillars follows from Theorem 16.21 in the 
next subsection. □ 

6.2. Lower Bounds for Trees. Let T~ be the tree obtained from T by removing 
all leaves, and define £(T) to be the number of leaves in T~. 

Theorem 6.2. For a tree T that is not a star, 

h{T)>\E{T)\+\i(T)/2]. 

Proof. If T is the tree consisting of a central edge ( u , v) and pendant edges attached 
to both u and n, then ( u , v ) is cut edge. By Proposition 13.71 T is not single-headed 
and the bound h{T) > \E(T)\+ \£{T)/ 2] = |i?(T)|-|-l holds. 

So we may assume that T is not that tree. Suppose that H is a hypergraph 
representing T. Let the leaves in T~ be ni,... ,vt, where i = £{T), and let Ui be 
the unique neighbor of Vi in T~. Note that by the assumption on T, no vertex Ui 
can be identical to any of the leaves Vj. Define T Vi to be the subtree of T rooted 
at Vi, containing Vi and its leaf neighbors in T. 

We construct an injection / by uniquely associating with each subtree T Vi a 
hyperarc of H having a multi-headed body. Let Heads(u, v) denote the set of heads 
of hyperarc in H having body ( u , v). 

Step 1: Consider all subtrees T Vi containing a multi-headed body ( Vi,w ) of H. 
If the set Heads(ui,w) contains a vertex x ^ U j^iT Vj then define 

(2) f(i) := (vi, w —>• x). 

Assume that Heads(ui, in) C U j^iT Vj . The set {vi, in}UHeads(ni, in) must contain 
an edge other than ( Vi,w ), otherwise forward chaining cannot mark any further 
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vertices. Thus there must be at least one other subtree T v . for j ^ i containing two 
heads of ( Vi,w ). Let the two heads be x\ and X 2 and define 

(3) f(i) := (vi,w ->• an), f(j):=(vi,w^x 2 ). 

The hyperarcs selected in this step are all distinct. 

Step 2: Consider all subtrees T Vi for which / is undefined after Step 1 such 
that ( Ui,Vi ) is multi-headed in H. Again, if the set Heads(uj, in) contains a vertex 
x Uj^iT Vj then define 

(4) f(i):=(m,Vi^x). 

Note that the body () has not been used so far and so / remains injective. 

Assume that Heads(it*, vi) C \Jj^ t T v ,. The body ( Ui,Vi ) has to reach the leaves 
of T Vi , hence there must be hyperarcs with body outside T Vi and having a leaf of 
T Vi as their head. The body of such a hyperarc must be multi-headed as otherwise 
forward chaining could not continue. Let x,y —> w be any such hyperarc and put 

(5) f(i) := (x,y —»• w). 

The partial mapping is still injective, as hyperarcs of type © and 0 have heads 
outside the subtrees, and hyperarcs of type © have heads in subtrees for which / 
is defined in Step f. 

Step 3: Consider all subtrees T Vi for which / is undefined after Steps 1 and 2. 
For these subtrees it holds that edge (m, Vi) and edges ( Vi,w ), where in is a head in 
T Vi , are all single-headed in //. Also, ( Ui,Vi ) must have its only head outside T Vi . 
else no edge in the subtree T Vi can reach any vertex outside T Vi . Indeed, if there is 
a hyperarc Vi,w —>• x, where in is a leaf of T Vi and x $ T Vj , then the body (ty, in) 
can either reach Ui only (if x = Ui) or no vertex other than its head. 

Repeating the argument in the second half of Step 2, there must be a multi¬ 
headed hyperarc x, y —> w with body outside T Vi and having a leaf of T Vi as its 
head. Then we define 

(6) f(i) := x,y ->w. 

The partial mapping is still injective, as hyperarcs of type 0 and 0 have heads 
outside the subtrees, and hyperarcs of type @ and © have heads in subtrees for 
which / is defined in Steps 1 and 2. 

Thus H contains at least £ hyperarcs having multi-headed bodies. Let these 
hyperarcs have k different bodies. Then the excess of H is at least k, as every 
multi-headed body contributes at least one to the excess, and at least i — k, as the 
k edges altogether contribute at least i — k to the excess. Thus the excess is at least 
ma x(k,£~k)> f^/2~|. □ 

Theorem 16 . 21 can also be formulated for general graphs containing outer induced 
subgraphs that are trees (i.e., graphs that contain some non-leaf vertices that be¬ 
come leaves after all their neighbors of a degree-1 are deleted). The proof is essen¬ 
tially the same. 

Corollary 6.3. Let G~ be the graph obtained from G by removing all degree-1 
vertices, and define £(G) to be the number of degree-1 vertices in G~. If £(G) > 1 
then 


h(G) > \E(G)\+ \1{G )/21 . 


HYDRAS: DIRECTED HYPERGRAPHS AND HORN FORMULAS 


13 


7. Special classes of trees 

7.1. Spiders. A spider is a tree with at most one vertex of degree at greater than 2. 
Let u be the vertex with degree at least 3. For convenience, we call the unique path 
from a leaf of a spider to u, including the endpoints, a leg. The upper bound from 
Theorem 14.21 is sharp for spider trees (even if we require the subgraph G' to be 
connected) and so is the lower bound from Theorem 16.21 

Corollary 7.1. If T is a spider tree with I legs of length at least two, then 

h{T) = \E(T)\+ \l/2\ . 

Proof. The upper bound is a corollary of Theorem 14.21 as L(T) can be covered with 
\l/2\ paths obtained from pairing up the legs of the tree and taking their line 
graphs. The lower bound is a corollary of Theorem 16.21 since I = £(T). □ 

Let Tfe to be the spider tree with k legs of length exactly 2. 

Theorem 7.2. The maximum hydra number of a tree T is L(5|V r (T)|—3)/4j, and 
this is attained by the trees Tk- 

Proof. The maximum interval number of tree is |_(5|C(G)| —3)/4j (shown in [2]). 
By Corollary 14.51 h(T) < r(T) and the same upper bound is established for the 
maximum interval number of a tree. 

On the other hand recall that T% is a spider tree with k legs of length 2, 
\V(Tk)\= 2k + 1, \E(Tk)\= 2k and hydra number h{Tk) = \E(Tk)\+ \k/2] = 
L(5|F(T fe )|—3)/4j . □ 

Remark 7.3. Recall that Proposition 13.51 implies that any one edge added to T\ 
can increase the hydra number by at most one, and cannot increase the excess of 
the hydra number, the quantity (h(G) — |i?(G)| / ). Every connected graph contains a 
spanning tree, thus the maximum value of the excess h(G) — \E(G)\ for any connected 
graph G is attained by the tree Tj~. 

In the discussion of the hydra number we considered single-headed graphs, which 
imposes a restriction on the outdegrees of hyperarcs. Now we turn to the discussion 
of indegrees and consider the largest in-degree of a vertex in an optimal H. The 
following results show that the trees Tk introduced above have interesting properties 
with respect to the indegree of their central vertex: that vertex has large indegree 
in every optimal representation, and bounding its indegree results in a blow-up of 
the size of its representations. 

Theorem 7.4. The central vertex u is the head of at least |_fc/2j hyperarcs in every 
optimal hydra H representing Tk. 

Proof. Let u be the central vertex of Tk, Vi be one of the k = £(T) neighbors of u, 
and Wi be the leaf neighbor of ty. Consider an optimal hypergraph representing Tk 
with \E(Tk)\+ \k/2~\ hyperarcs. Either {vi,wf) is single-headed in H , in which case 
u must be its head, or it is multi-headed in H. By the optimality of H exactly at 
most |~A:/2] of these edges is multi-headed, and thus at least \_k/2\ of them must 
be single-headed. □ 

The trade-off between the indegree of the center and the size of representations 
can be formulated as follows. 
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Figure 4. Induction on the line graph L(B d ) from the proof of Theorem 17.61 


Theorem 7.5. For any k and any d, such that 1 < d < [fc/2_|, if H is a hypergraph 
representing Tk, such that the center u is a head of at most d hyperarcs, then the 
number of hyperarcs in H is at least \E(Tk)\+k — d. 

Proof. As before, each single-headed body ( Vi,Wi ) must have u as its head, thus 
at most d of these bodies are single-headed. Therefore at least k — d of them are 
multi-headed and so the excess is at least k — d. □ 

7.2. Complete Binary Trees. In this section we obtain upper and lower bounds 
for h(G) when G is a complete binary tree. A complete binary tree of depth d , 
denoted B d , is a tree with d + 1 levels, where every node on levels 1 through d has 
exactly 2 children. Bd has 2 d+1 — 1 vertices and 2 d+1 — 2 edges. 

Theorem 7.6. For d > 3 it holds that 

(7) l\E(B d )\<h(B d )<^\E(B d )\ + l. 

Proof. The lower bound follows from Theorem 16.21 since there are 2 d ~ 1 leaves in 
(. B d )~ (all vertices in the (d — 1) level of the tree). 

For the upper bound, recall that p(T) is the minimum path cover number of 
L(T') over all subgraphs T' with no isolated vertices. We show that there is a 
spanning subgraph B' of B d with no isolated vertices with a path cover containing 
at most [(2/15 )\E(B d )\] paths. Specifically, we delete every fourth level of edges in 
the binary tree, leaving a forest of B^s. This corresponds to deleting every fourth 
level of vertices in L(B d ), to obtain L(B'), and gives 2 paths in L(B d ) for every 15 
edges in the tree. (See Figure [4]) □ 

Recall that pcn{G) is the path cover number of graph G, and let g{G) = 
pcn(L(G)). The following theorem demonstrates the significance of considering 
non-connected subgraphs of G when upper-bounding h(G) in Theorem 14.21 It 
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demonstrates that even for trees, there are examples where the quantity g(G) is 
significantly larger than the quantity p[G). 

Theorem 7.7. Ford large, 

g(B d )-p{B d ) = QQV{B d )\). 

Proof. Applying the algorithm of EH the path cover number of L(Bd) is g(Bd) = 
\\E(Bd)\ /7]. This is because this algorithm calculates the trail cover number of 
a tree, which by E3 Theorems 3 & 6] is equivalent to the minimum number of 
caterpillars required to cover the edges of a tree, and the path cover number of the 
line graph. One can construct an optimal caterpillar edge cover from the bottom 
of the tree, and from left to right, giving one caterpillar, and equivalently one path 
in the path cover of L{Bd), for every 7 edges. □ 

It would be interesting to close the gap between the bounds in Theorem 17.61 
The lower bound takes only the bottom part of the tree into consideration and thus 
it is not expected to be sharp. It is an open question whether the upper bound 
\E(T)\+p(T) is sharp for trees in general. 

8. A RELATED MINIMIZATION PROBLEM 

There are several ways in which one can generalize hydra numbers. One can 
consider hydra numbers for hypergraphs, where the bodies of the hyperarcs contain 
more than two vertices and so they form a hypergraph instead of a graph. Studying 
hydra numbers of hypergraphs is an open area of research. In this section we 
consider a second generalization of hydra numbers. 

Given n and a number k (2 < k < n — 1), let f(n , k) be the minimal number of 
hyperarcs in an ?r-vertex 3-uniform directed hypergraph H such that the closure of 
every ^-element subset of the vertices is the whole vertex set. The case k = 2 is 
just the hydra number of complete graphs and so /(n, 2) = Q). 

We use Turan’s theorem from extremal graph theory (see, e.g. EH). The Turan 
graph T[n,k — 1) is formed by dividing n vertices into k — 1 parts as evenly as 
possible (i.e., into parts of size [n/(k — 1)J and \n/(k — 1)]) and connecting two 
vertices iff they are in different parts. The number of edges of T(n, k — 1) is denoted 
by t(n, k — 1). If A; — 1 divides n then 



Turan’s theorem states that if an n-vertex graph contains no /c-clique then it has 
at most t(n,k — 1) edges and the only extremal graph is T(n,k — 1). Switching 
to complements it follows that if an n-vertex graph has no empty subgraph on k 
vertices then it has at least Q) — t{n , k — 1) edges. 

Theorem 8.1. If k < (n/2) + 1 then 

(2) _ k ~ ^ - ^ n ’ k ) - (2) ~ _ “ 1 )‘ 

Proof. Suppose H is a 3-uniform directed hypergraph with all fc-tuples having the 
whole set as their closure. Then every fc-element set S of vertices must contain 
at least one body of a hyperarc in H , otherwise forward chaining started from S 
cannot mark any vertices. Thus the undirected graph formed by the bodies in H 
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contains no empty subgraph on k vertices, and the lower bound follows by Turan’s 
theorem. 

For the upper bound we construct a directed hypergraph based on the comple¬ 
ment of T(n, k — 1) over the vertex set {aq, • ■ ■, x n }, consisting of k— 1 cliques of size 
differing by at most 1. Assume that each clique has size at least 3. In each clique do 
the following. Pick a Hamiltonian path, direct it, and introduce hyperarcs as in 0 
(with the exception of the last edge closing the cycle). For every other edge (u,v), 
introduce a hyperarc u,v -A w where w is a vertex on the Hamiltonian path that is 
adjacent to u or v. For each edge e closing a Hamiltonian cycle, add two hyperarcs 
with body e, and heads the endpoints of the first edge on the Hamiltonian path of 
the next clique (where ‘next’ assumes an arbitrary cyclic ordering of the cliques). 
For cliques of size 2 the single edge in the clique plays the role of the unassigned 
edge and the construction is similar. □ 
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